Combining Independent Knowledge Sources for Word Sense Disambiguation
نویسندگان
چکیده
Disambiguation Yorick Wilks and Mark Stevenson Department of Computer Science, University of She eld, Regent Court, 211 Portobello Street, She eld S1 4DP, UK fyorick, [email protected] Abstract Sense tagging, the automatic assignment of the appropriate sense from some lexicon to each of the words in a text, is a specialised instance of the general problem of word sense disambiguation. We discuss which recent word sense disambiguation algorithms are appropriate for sense tagging. It is our belief that sense tagging can be carried out effectively by combining several simple, independent, methods and we include the design of such a tagger. A prototype of this system has been implemented, correctly tagging 88% of polysemous word tokens in a small test set, providing evidence that our hypothesis is correct. 1 Sense Tagging There has been a tendency in word sense disambiguation (WSD hereafter) literature to carry out di erent disambiguation tasks and classify them all as WSD procedures. There are however, at least, three di erent levels of algorithm in WSD. The most speci c of these are sense tagging procedures, these assign, to each word1 in a text, its particular sense from some lexicon and each word type having a set of senses speci c to it. This di ers from the more general case of semantic tagging, where the tags for each word (type) need not be speci c to that type and do not correspond to word senses in a lexicon. These tags may be broad semantic categories such as HUMAN or ANIMATE or WordNet synsets which may apply to any word in the text (similar to part of speech tagging in the sense that there is a class of tags which may apply to any token in the text). The most general class of algorithms are semantic disambiguation algorithms. These algorithms are procedures which carry out semantic disambiguation on words, these may not necessarily be tagging algorithms, in that they do not attempt to mark every token in a text but may be restricted to disambiguating small sets of word types. The class of sense tagging algorithms is a proper subset of the class of semantic tagging algorithms, which is, in turn, a proper subset of the class of semantic disambiguation algorithms. This hierarchical relationship is represented in Figure 1. 1This is often loosened to each content word. Semantic Tagging Tagging Sense Semantic Disambiguation
منابع مشابه
Word Sense Disambiguation using Optimised Combinations of Knowledge Sources
Word sense disambiguation algorithms, with few exceptions, have made use of only one lexical knowledge source. We describe a system which performs unrestricted word sense disambiguation (on all content words in free text) by combining different knowledge sources: semantic preferences, dictionary definitions and subject/domain codes along with part-of-speech tags. The usefulness of these sources...
متن کاملCombining Weak Knowledge Sources for Sense Disambiguation
There has been a tradition of combining different knowledge sources in Artificial Intelligence research. We apply this methodology to word sense disambiguation (WSD), a long-standing problem in Computational Linguistics. We report on an implemented sense tagger which uses a machine readable dictionary to provide both a set of senses and associated forms of information on which to base disambigu...
متن کاملCombining Knowledge- and Corpus-based Word-Sense-Disambiguation Methods
In this paper we concentrate on the resolution of the lexical ambiguity that arises when a given word has several different meanings. This specific task is commonly referred to as word sense disambiguation (WSD). The task of WSD consists of assigning the correct sense to words using an electronic dictionary as the source of word definitions. We present two WSD methods based on two main methodol...
متن کاملCombining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation
This work combines a set of available techniques – which could be further extended – to perform noun sense disambiguation. We use several unsupervised techniques (Rigau et al., 1997) that draw knowledge from a variety of sources. In addition, we also apply a supervised technique in order to show that supervised and unsupervised methods can be combined to obtain better results. This paper tries ...
متن کاملCombining ConceptNet and WordNet for Word Sense Disambiguation
Knowledge-based Word sense Disambiguation (WSD) methods heavily depend on knowledge. Therefore enriching knowledge is one of the most important issues in WSD. This paper proposes a novel idea of combining WordNet and ConceptNet for WSD. First, we present a novel method to automatically disambiguate the concepts in ConceptNet; and then we enrich WordNet with large amounts of semantic relations f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997